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Abstract. Let U be a Haar distributed matrix in U(n) or O(n). In 
a previous paper, we proved that after centering, the two-parameter 
process 

T W (M)= E l^'l 2 

converges in distribution to the bivariate tied-down Brownian bridge. 
In the present paper, we replace the deterministic truncation of U by 
a random one, where each row (resp. column) is chosen with prob- 
ability s (resp. t) independently. We prove that the corresponding 
two-parameter process, after centering and normalization by n" 1 / 2 con- 
verges to a Gaussian process. On the way we meet other interesting 
convergences. 

1. Introduction 

Let us consider a unitary matrix U of size n x n. We fix two integers 
p < n and q < n and delete deterministically n — p rows and n — q columns. 
Let us call U p ' q the (rectangular) matrix so obtained. It is well known 
that if U is Haar distributed in U(n), the random matrix U p ' q (U p ' q )* has 
a Jacobi matricial distribution and that if (p/n,q/n) — >• (s,t) £ (0, l) 2 , its 
empirical spectral distribution converges to a limit T> s ^ (see for instance [5]), 
often called the generalized Kesten-McKay distribution. It is clear from the 
invariance of the Haar distribution on U(n) that we can keep the first p rows 
and the first q columns. 

In [TU] we studied the trace of U p ' q (U p ' q )* . It is the squared of the Frobe- 
nius (or Euclidean) norm of U p ' q . Actually we set p = \ns\,q = [nt\ and 
considered the process indexed by s, t 6 [0, 1]. We proved that, after center- 
ing, but without any normalization, the process converges in distribution, 
as n — > oo to a bivariate tied-down Brownian bridge. Previously, Chapuy [4] 
proved a similar result for permutation matrices, with a n" 1 / 2 normalization. 

Besides, for purposes of random geometry analysis, in [13] (see also [12J) 
B. Farrell deletes randomly and independently a proportion 1 — s of rows and 
a proportion 1 — t of columns. Let us call lA s,t the matrix so obtained. Farrell 
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proved that (for fixed s, t) the empirical spectral distribution of U s ' 1 (U s,t ) 
has an explicit limiting distribution which is precisely T> s ,t- Actually, Farrell 
considered first the (deterministic) DFT matrix 

and proved that a Haar unitary matrix has the same behaviour. 

It is then tempting to study the same statistic as above, when using this 
random truncation. Of course, instead of the DFT, we can as well consider 
any (random or not) matrix whose all elements are of modulus n _1//2 , for 
instance a (normalized) complex Hadamard matrix. 

To define all random truncations simultaneously and get a two-parameter 
process, we define a double array of n 2 auxiliary uniform variables, in order 
to generate Bernoulli variables. We will prove below that after centering, 
we need a normalization to get a Gaussian limiting process. 

We use the Skorokhod space D([0, l] 2 ). It consists of functions from 
[0, l] 2 to M which are at each point right continuous (with respect to the 
natural partial order of [0, l] 2 ) and admit limits in all "orthants". The 
space D([0, l] 2 ) is endowed with the topology of Skorokhod (see [3] for the 
definition) . 

For the sake of completeness, we treat also the one-parameter process, 
i.e. truncation of the first column of the unitary matrix, and the case of 
permutation matrices. 

In Farell [13] is also mentioned another way of truncation, used in [18J, 
consisting in drawing only one array of n Bernoulli variables to determine 
the choice of rows and columns. We did not consider this model here, to 
make the paper shorter. 

The rest of the paper is organized as follows. In Sec. 2 we introduce the 
basic notations with the main statistics and the list of limiting processes. 
In Sec. 3 we present the statements of convergence in distribution for all 
models. In particuler Theorem 13.31 savs that at the first order, the difference 
between DFT, Haar unitary or Haar orthogonal random matrices is not 
seen by the statistics 7^ n ' at the limit since only the randomness of the 
truncation contributes. Besides, Proposition 13.51 savs that this difference is 
seen at the second order, by means of the process Z^ n \ Sec. 4 is devoted 
to the proofs. The most delicate part is the proof of Prop. 13.51 (2). We first 
prove the convergence of finite dimensional distributions after replacement 
of the indicator variables by Gaussian ones (Lindeberg's strategy). Then we 
prove tightness of the process rescaled by ra _1//2 by application of Davydov 
and Zitikis's criterion [8j. It could be noted that before that, we tried 
several tightness critera for the unsealed (two-parameter) process, such as 
Bickel and Wichura [3] or Ivanoff [2], but they failecQ. In Sec. 6, we gather 



We hope to address a definite answer to the question of weak convergence of Z^"' in 
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all the estimates of moments; i.e of polynomial integrals on U(n) and O(n). 
Finally in Sec. 7, we give the proof of the Lindeberg's strategy. 

2. Notations 

We introduce the random processes that we will consider in this paper 
and the various limiting processes involved. 

2.1. The main statistics. Let U G U(n) be the unitary group of size n 
and let Uij be the generic element of U. Let now = 1, ■ ■ ■ , n and 
Cj,j = l,---,nbe two independent families of independent random vari- 
ables uniformly distributed on [0, 1]. 

For the one-parameter model, we introduce two processses. 

/M \ 
B [ o ] = \y2{\U a \ 2 -l/n) , s e [0, 1] 1 , (2.1) 

= (E W (!*<• - s ) . s G [°. • ( 2 - 2 ) 
For the two parameters model, we introduce the same framework. Let 

i< \ns\ ,j< \nt\ 

and 

r<»> = (t£W € [o, i]) . 

Let now 



:= E l^flfli^lc^t • (2-4) 
We have clearly, since \Uij\ 2 and (Ri,Cj) are independent, 

r w _ E7 >) = £ f _ st] . (2 . 5) 

Introducing the centering of Bernoulli variables, and using the fact that the 
matrix U is unitary, we may write 

r H_ ET H =z (n) +nl/2w H m 

where 

n 

Z { $ ■■= I^«f [!*<.-'] [!<*<*-*] (2-7) 
wj? := s F t (n) +tG£ n > (2.8) 
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with the empirical processes 

n n 

Ft ] := n-W £[l Cj < t ~ t] ; := n~V 2 - a] . (2.9) 

j=l i=l 

In the sequel, we denote, 

= (7#U t € [0, 1]) , 2W = (2 s ( J, Sl t G [0, 1]) , 
FW = (jf B) ,i€[0,l]) , G(") = (G; n) ,tG[0,l]), 

and 

w(") = (wif, s ,*€[o,i]). 

2.2. Gaussian processes and bridges. The classical Brownian bridge 
denoted by Bq is a centered Gaussian process with continuous paths defined 
on [0, 1] of covariance 

E (Bq(s)Bq(s')) = a A a' - ss' . 

The bivariate Brownian bridge denoted by Bqq is a centered Gaussian pro- 
cess with continuous paths defined on [0, l] 2 of covariance 

E (B ,o(s, t)B , (s', t'))) = (a A s')(t A t') - ss'tt' . 

The tied-down bivariate Brownian bridge denoted by is a centered 

Gaussian process with continuous paths defined on [0, l] 2 of covariance 

E[W {oo \s,t)W {QO \s',t')} = (sAs'-ss')(t M'-tt'). 

Let also W^ 00 -* be the centered Gaussian process with continuous paths de- 
fined on [0, l] 2 of covariance 

E[W (oo) (a, t) W (oo) (a', t')] = aa'(t A t') + (a A a')tt' - 2aa'tt'. 

It can be defined also as 

W (oo) (a,t) = sB (t) + tB' (s) 

where Bq and B' two independent one parameter Brownian bridges. 

At last we will meet the process denoted by Bq (g> Bq which is a centered 
process with continuous paths defined on [0, l] 2 by 

Bo®Bo{s,t) = Bo{s)B {t) 

where Bq and B' are two independent Brownian bridges. This process is 
not Gaussian, but it has the same covariance as 
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3. Convergence in distribution 

We present unfied results in the cases of unitary and orthogonal groups. 
For this purpose we use the classical notation 

z (3 I 1/2 in the orthogonal case, 
2 1 1 in the unitary case. 

Let us begin with the one-parameter processes, where means conver- 
gence in distribution in D([0,1)). 

Theorem 3.1. We have : 

^B(») ^ Bo , (3.1) 
V^B^ ^ ^/TTW^Bo. (3.2) 



(|3.ip is well known since at least Silverstein [17] (in the case j3' = 1). 
Both results are a direct consequence of the fact that the vector (\Un\ 2 ,i = 
1, . . . , n) follows the Dirichlet (/?',..., f3') distribution on the simplex (see a 
detailed proof in Section 4). 

Let us continue with the two-parameters processes, where now means 

convergence in distribution in D([0, l] 2 ), and means convergence in dis- 
tribution of finite dimensional marginals. The different normalizations are 
explained in Remark 13.71 

We begin with a recall of the convergence in the deterministic model of 
truncation which was proved in Theorem 1.1 of [10J. 

Theorem 3.2. Under the Haar measure on U(n) or 0(n), 



In the sequel we will prove the following convergences for the second model 
of truncation. 

Theorem 3.3. (1) Under the DFT model, or more generally when \Uij\ 2 
1/n a.s. for every i,j, 



n 



-1/ 



/2 ( jin) _ E 7-(nA yy(oo) _ 



(2) Under the Haar measure on U(n) or O(n), 

n -l/2 (jin) _ E7-(n)^ yy(oo) 



The proof uses the decomposition (|2.6|) : 

7"(«) _ E7-(n) = n V 2 yy W + jjM . 

The following trivial lemma (consequence of the convergence of empirical 
processes and G^) rules the behavior of and the following Propo- 
sition [33] provides a convergence in distribution of hence a convergence 
in probability to for n _1//2 Z^ n \ 
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Lemma 3.4. 

W (n) ^ W°° . (3.3) 

Proposition 3.5. (1) Under the DFT model, or more generally if \Uij\ 2 - 
1/n a.s. for every i,j, 

(2) Under the Haar model, 

Z (n) 1^ ^pi w (co) +Bo ® Bof (3.4) 

where Bq ® I?o ffl^d l/f^ 00 ) are independent, and 

n ~l/2 z (n)^ Q (35) 

Since our studies have for origin the article of Chapuy [4] which proved 
(|3.6p on random permutations, we give now the complete behavior of the 
above statistics in this case. 

Theorem 3.6. Assume that U is a random permutation of [raj. We have 

n -l/2 W (n) w(oo) (36) 

1/2 f T (n)_ ET (n)\ fi ^ 



n 

n -l/2 2 (n) ^ (oo) _ (3g) 

Remark 3.7. Let M p ' q = U p ' q (U p ' q )* and M 8 '* = W'* (U 8 *)* . For s,t 
fixed, the random variables 7^" and are linear functionals of the em- 
pirical spectral distribution of M L ns J > L ni J and M s ' 1 respectively. For classical 
models in Random Matrix Theory, the convergence of fluctuations of such 
linear functionals do not need a normalizing factor, since the variance is 
bounded (the eigenvalues are repelling each other). Here, this is indeed the 
case for T^f (see [llj for the complete behavior for general tests functions) . 

But, in the case of iff , we have Var E[7^|-Ri, . . . , R n , la, ... , L n ] = 0(n), 
which demands a normalization. 

Remark 3.8. Going back to the decomposition It 2. 6]) . gathering the conver- 
gences in \3. 3]) fff.Tp and \3. 8\) and the covariances, we recover the identity 

B 0fi l = W°° + W°° 

where the two processes in the RHS are independent, fact which was quoted 
in [9] section 2. 

4. Proofs 

4.1. Proof of Theorem l3.H It is known that the vector (|C/ii| 2 , i = 1, . .., n) 
is distributed uniformly on the simplex. Actually, the result is a consequence 
of the following proposition. 
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Proposition 4.1. Let (iij, i = 1, . . . , n) Dirichlet (/?', ...,/?') distributed. 

(1) T/ie process « 1 ^ 2 {X]|'=i ( u « — V n ) > s ^ [0? 1]} converges in distri- 
bution to \J /3'~ 1 Bq. 

(2) TTie process n 1 / 2 !^™^ (1_r^< s — s) , s £ [0, 1]} converges in distri- 
bution to a/1 + /3 /_1 -B . 

Proof. It is easy to see that 

, \ law / gi En \ / a i\ 

ui, • • • , u n = ■ ■ , • • • , ■ ■ (4.1 

\gl H hgn gl H hgn/ 

with gj independent and gamma distributed with parameter f3' . 

(1) The above representation yields the following equality in law between 
processes 

\ns\ 



n 1 ' 2 



{ 5 " = { ^ n W) 



i=l 



where S^k = X^.=i Si- The WLLN and the Donsker's theorem gives the 
convergence to \J (3'~ 1 (B(s) — sB(l)) where B is the standard Brownian 
motion. 

(2) The process n 1 / 2 !^™!^ (Ir^s — s) , s S [0,1]} is an example of a 
weighted empirical process. We may apply the Theorem 1.1 of Koul and 
Ossiander [15] p. 544. Set v n , = nuj be the weights. Under the following 
conditions on the weights: 

(1) (n- 1 ^) 172 A 7 

(2) n -1 / 2 maxj v ra , — > 0, 

the process converges to the product of 7 by an independent Brownian 
bridge. For the first one, we start from the representation (|4.ip . apply the 
SLLN twice and get 



n ^ v 



For the second one, we have: 



1/2 max{gi,i = 1, . . . ,n} 



—i 11 1 / 

rt ' maxv n j = n 1 

i gi H h g n 

Appealing again to the SLLN, it is enough to prove that maxjgj, i = 1, . . . , n} 
o(ra 1//2 ) in probability, which is clear (this maximum is of order logn). □ 

4.2. Proof of Theorem 13.61 We notice that the double sums in the def- 
initions of 7~( n ) or i?( n ) are actually single sums where j = n(i) for ir a 
random permutation. Moreover, by exchangeability of the variables Cj, 
these single sums have (as processes in s, t) the same distributions as if tt 
was the identity permutation. In this case, we may apply the results of [IB] . 
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since n _1 / 2 [T (n) - ET (n) ] and n" 1 / 2 ^™) are precisely X% and Y n therein, 
respectively. 

4.3. Proof of Proposition 13. 5L Under the DFT model, we have = 
(g) G^ n \ so we have only to study the Haar model, in U(n) and in O(n). 
As long as possible, we keep the notation Uij for the generic element of 
U in both cases, but we use the notation Oij when we need to stress that 
the underlying unitary matrix is sampled according to the Haar measure in 
O(n). 

First, we prove (|3,4p . i.e. the convergence of finite dimensional distribu- 
tions, beginning with the 1-dimensional marginals since the notations are 
easier to follow for the reader, and then using the Cramer- Wold device. In 
a last part, we prove the tightness, using a criterion due to Davydov and 
Zitikis [8]. 

4.3.1. 1-Marginals. We first consider the convergence of the 1-marginals, i.e. 
for fixed s, t € [0, 1] , we show the convergence of Z^> (s, t) to \J fi'-^W^ (s, t)+ 
Bq <g) Bq(s, t). By a scaling property, this is equivalent to prove the weak 
2;( n )( s t) 

convergence of — ' towards \f /3'~ 1 iVi + N2N3 where iVj are 

y/s(l- S )t(l-t) 

independent standard Gaussian variables. According to the next Proposi- 
tion, whose proof is given in the Appendix, we can replace the independent 
centered random variables 

W:=^^,C^.= ^0Zl. i,j = l,.. .,„ 
by independent M(0, 1) random variables Xi and Yj. 

Proposition 4.2. Let (B4) and (Cj) two independent sequences of iid cen- 
tered variables, with variance 1, and finite third moment. Consider also (Xj) 
and (Yj) two independent sequences of iid standard Gaussian variables. We 
define A n = Y^i,j=i \Uij\ 2 RiCj and B n = Yhj=i \Uij\ 2 XiYj. Then, A n and 
B n have the same limiting distribution. 

We thus study the bilinear non symmetric form 

n 

E n := ^ ^ Xi\Uij\ 2 Yj 

built from the non symmetric matrix U = (|t^j| 2 )j < ■ This matrix is a 
so-called "doubly stochastic" matrix ([2], [7]). As a Markovian matrix, it 
has 1 as dominant eigenvalue. 

Proposition 4.3. The sequence of random variables 

n 

S n := ^ ^ Xi\U{j\ Yj 



RANDOM TRUNCATION AND BRIDGES 



converges in distribution towards \J (3' 1 A r i + N2N3 where N\,N2,N^ are 
independent and J\f(0, 1) distributed. 

Proof. We know that E|E^-| 2 = n~ l . We may remark that 

*».7 — 1 



and that 5' '= A^iVs- Set 



with 



S n '■— S n S n — ^ ^ XiVijYj 



*v = l^f-n" 1 - 



We will use the characteristic function and conditioning. Set (X, V) := 
< n,Vij,i,j < n). Conditionnally upon (X, V), the vector (S n ,S' n ) is 
Gaussian and 

n 

cov(S n ,S' n \X,V) = n- 1 XiXkVktcoviYjM) 

ij,k,e=i 

n n I n \ 

= n- 1 XiXkVkj = n" 1 Wk Y. Vk 3 = 

i,j,k=l i,k=l \j=l J 

since U is unitary. The variables S n and S' n are then independent, condi- 
tionnally upon (X,V). Moreover 



E[expi9S' n \(X,V)] = E[expWS' n \X] = exp-- ^ =1 l} (4.2) 

and 



E[ex V ieS n \(X,V)} = exp--S n (4.3) 

where 

S* n := ^ ( VijXi j = ^2 XiXjHij , 

j=l \i=l / ij=l 

with 

#ij = (VV*)ij , i,j = l,...,n. 

This implies 

Ql ( n \ 

Eexpi0£ n = Eexp-y [n' 1 (^Xtf + S n J (4.4) 
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Since (Yl Xi) 2 jn is distributed as N 2 (where N is J\f(0, 1)), if we prove that 

n / n \ 2 

S,:=E E^i A/?" 1 , (4-5) 
j=i \i=i J 

we will conclude that 

Eexpi(9S n ^Eexp-y(iV 2 + /3 / - 1 ) 



and this limit is the characteristic function of N 2 N% + \J j3' 1 N\. 
It is clear that (|4.5|) is implied by 

limES n = /3'- 1 (4.6) 

n 

limE(S n ) 2 = /3'- 2 . (4.7) 

n 

To prove these assertions, we will need some joint moments of elements 
of the matrix H, which are themselves affine functions of moments of the 
matrix U. 

The first limit in (|4.6p is easy to obtain since 



(4.8) 



ES n = EH a = nEH n = n 2 E(V u ) 2 = n 2 Var V n 

i 

\n 2 Var |On| 2 = 2 ^t^ in the orthogonal case, 



n 2 Var|C/n| 2 = ^q4 in the unitary case, 



n+2 
n+1 

where we refer to (|5.ip . 

To prove (|4.7p we expand S 2 : 



S 2 = XiXjXkXiHijHf-e ■ 

i,j,k,e=i 

In the expectation, the only non vanishing terms are obtained according to 
the decompostion 

^ = E + E + E + E ( 4 - 9 ) 

i=j,k=£y^i i=k,j=£j^i i=£,j=k^i i=j=k=£ 

= (l) + (2) + (3) + (4). (4.10) 
Since (EX 2 ) 2 = 1 and EXf = 3 we have successively 

(1) = n(n-l)E(H n H 22 ) (4.11) 

(2) = (3) = n(n-l)E(H 2 2 ) (4.12) 

(4) = 3nEiJf x . (4.13) 

Now 

Etf 2 ! = nET# + n(n - 1)E (4.14) 

E(H n H 22 ) = nE(V^Vl)+n(n-l)E(V^Vi 2 ) (4.15) 
E(tf 2 2 ) = nE + n{n - 1)E (^11^12^21^22) • (4.16) 
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this last equality coming from the symmetry of H. 
Prom Lemma 15.11 (see (|5.7j) with k = 4), 

E S 2 = n\n - 1) 2 E (VftV^) + 2n 2 (n - 1) 2 E (^11^12^21^22) + o(l). (4.17) 

and from Lemma 15.3} we conclude that (|4.7j> holds, which ends the proof of 
Ol □ 

4.4. Finite-dimensional marginals. We now consider the convergence of 
the finite dimensional distributions, following the same scheme of proof as in 
the case of the 1-dimensional marginal. The Lindeberg's strategy statement 
is now: 

Proposition 4.4. Let (ft(s)) and ('ji(s)) be two independent sequences of 
independent Brownian bridges. Define G n (s,t) = Ylij=i\Uij\ 2 /3i(s)"/j(t). 
Then, the processes Z n and G n have the same finite dimensional limiting 
distributions. 

We then consider the convergence of the finite-dimensional distributions 
of the process 

G n {s,t) = Y,\Ui j \ 2 (3 i {s)~ /j {t) = S n (s,t) + S' n (s,t) 

where S' n (s,t) = ± ^ ft(s) 7 ,-(i), S n (s,t) = Eij^A(a)7j(*) and (ft), 
(7j) are two independent sequences of independent bridges. 
Let (si, t±), . . . , (sk, tx) £ [0, l] 2 , ai,..., ax £ K and define 

if 

= ^ oiiG n (si,ti) := 5 n + 
1=1 

according to the decomposition of G n . Conditionnally to (V, ft,i < n), 
(5 n , S,fJ is a Gaussian vector with covariance 

K n 

cov(S,S'\(V,P)) = -J2 E «J«i'^(*0ft'(«i')cov( 7i (t,),7i'(<i')) 

Z,Z'=1 i,j,i',j'=l 
^ K n n 

= ^ azaz'ft(s/)ft'(sz')S'(^,^')X]^' 

71 l,l'=li,i'=l j=\ 
= 

where g(t,t') denotes the covariance of the Brownian bridge and we use that 
^2jVij = 0. Thus S n and S 1 ^ are conditionnally independent given (V, 0). 
Thus, 

E[expi^ n |(y,/3)] = E(E[expi^|(y,/3)]E[expi^ n |(y,/3)]) 
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^2 I K n n 

E[expiBS' n \(V,0)] = exp I -— ^ a f a,/p(t,, 



l,i'=l i=i i=i 

K 



law 



exp j -y I ^ aia v g(t h t v )l3(si)j3(si 
\U'=l 



and 



where 



£[expi0S n |(y,/3)] =exp( --S n 



K 



S n = ^2 a i a i'9(U,ti') ^2 Hi,kPi(si)PkM- 

1,1' =1 i,k=l 

We now prove the convergence in probability of S n to 

K 

L:=f3'~ l ^2 a l a l'9{si,s v )g{ti,t v ) . 
l,l'=l 

We have first 

K n 

E(S n ) = ^2 aiai>g{ti,ti')g(si, s;/) ^ E(g^) 
M'=i i=i 

and Ei E (^u) = nE(JTii) — »■ Now, 

K n 

S n = ^2 a i a i' a P a v'9{~tuti')g{tp-,t p i) 

l,l',p,p' = l i,j,k,q=l 

Hi k H jq (3i(si)f3 k (si')f3j(s p )f3 q (s p ') . 

Taking the expectation, non zero terms are obtained when the indexes 
i,j,k,q are equal 2 by 2. Moreover, the only non null contribution at the 
limit is given for the combination i = k,j=q, i^j from Lemma 15.31 We 
thus obtain that E(S^) converges to 

K 

/3'~ 2 ^2 a i a i' a p a p'9(U,ti')g(tp,t p i)g(si, si/)g(s p , s p i) = L 2 
i,1',p,p'=i 

proving the convergence in probability of S n as desired. Thus, we have 
obtained 

E(expi0E n ) — > E ( exp— y ( J2 m<x V g{t h t/O0(«i)0M + L 
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Since 



and 

32 



E Uxp-- \^aia v g(t u t v )P( ai )P{8 V )\\ = E |^exp i6 ^ aiB° <S> B°(si,ti) 

we have proved (|3.4p in the sense of the finite dimensional distributions. 

4.5. Tightness of n -1 / 2 'Z^ n \ For the tightness, owing to the structure of 
the process Z n we apply a criterion due to Davydov and Zitikis [8] which 
can be reformulated as follows. 

Theorem 4.5 (Davydov-Zitikis). Let £W be a sequence of stochastic pro- 
cesses in D([0, l) 2 such that ^ — » £ and ^(0) = 0. Assume that 

(1) there are constants a > j3 > 2, c G (0, oo) and a n \, swc/i i/iat /or 
all n > 1, we /iaue 

supE(ie (n) ( s + s ',t+o-e {n) ^,on) <cii( s ,t)ii^, (4.18) 

s',t' v 7 

whenever ||(s,t)|| > a n; where = max(|s|, \t\). 

(2) the process can be written as the difference of two coordinate- 
wise non- decreasing processes t;^ and ^ such that £^ satisfies 



sup max{^ n) (s, t + a n ) - £,f> (s, t) , (s + a n , t) - (s, t)} = o P (l) , 
s,t<l 

(4.19) 



i/ien the process converges weakly to £. 

We will prove the two following lemmas. 
Lemma 4.6. Z( n ) satisfies &4.18\ ). 

Lemma 4.7. For any sequence (b n ) with b n — > +00, b~ l Z^ n > satisfies j^. 19 ). 

Proof of Lemma 14.61 Let us compute the moment of order 6 of the in- 
crements. Let s,s',t,t' 6 [0,1] such that s + s' < l,t + t' < 1 and 
B =]s', s + s']x]t', t + t']. Recall that the increment of Z n over B is given by 

Z n (B) = Z n (s + s',t + t') - Z n (s + s', t') - Z n (s', t + t') + Z n (s', t'). 

It is easy to see that 

K(Z n (Bf) =E(Z n (s,tf). 
We have, using the notation B; L = (1(^< S ) — s) and B'- = (l(Cj<t) ~ : 

E(Z n (Bf)= Yl E (fll^iJ 2 )E(n^ fc )E(n^J. (4.20) 

ikdkik=l,.-.6 k=l k=l k=l 
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Since the Bi and B'- are independent and centered, in the RHS of the above 
equation, the non null term in the sum are obtained when the i/. (resp. the 
jk) are equal at least 2 by 2. We now use the following properties: for some 
C > 0, 

• |E(Sf)| < Cs for 2 < k < 6 

• |E((^) fc )| < Ct for 2 < k < 6 

• E(nLil^J 2 ) = 0(^r) (see 

It follows that (C may change from line to line) 

m n {Bf) <c (4 + 4 + 4 + 4 + 4^ + — + • ( 4 - 21 ) 

yra 4 n J n z n J n z n J 
Note that if s > - and i > -, 

E{Z n (Bf) < Cs 3 t 3 . (4.22) 
Define \\{s,t)\\ = sup(|s|, |*|). If ||(s,*)|| > ~, we can see from (|PT|) that 

E(Z n (B) 6 ) <C||( S ,i)l| 3 - (4.23) 

We now consider the increment Z n (s + s',t + i') — Z n (s',t'). It is easy to 
see that 

£„(s + a', t + - t') = Z n {B) + Z n (Bx) + Z n (B 2 ) 

where B x = [0, s'] x [If, t' + t] and B 2 = [s\ s' + s] x [0, £']. Therefore, 

E(|2 n ( s V,^+^0-2n(s^^0l 6 )<^(E(^n(5) 6 )+l(^n(Sl) 6 )+E(-^n(S2) 6 )). 

From (|4.2ip . we can see that in any case, for i = 1, 2, if ||(s, t)|| > ^ : 

E(Z„(^) 6 ) <C||( S ,t)|| 3 . 
For example, if s < ^ < t, 

E(Z n (5 2 ) 6 )) < <?4 < C«* 2 < Ct 3 . 
We have thus proved the following: if ||(s, > ^ 

E(\Z n (s + s',t + t') -Z n (s',t')\ 6 ) < C\\(s,t)\\ 3 , 
which is exactly (|4.18p with a n = n , a = 6 and /3 = 3. □ 

Proof of Lemma \4.7\ Indeed, we can write: 

Z n (s,t) = Ei(s,t) - E 2 (s,t) 

with 

Hl(s,t) = ^ WijfltiKalCjKt + St , H 2 (s,t) = Sy^l Ci <t + t'^lL l <s 

i,j j i 
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and both these processes are coordinate-wise non-decreasing. Let us now 
check (|U2)| with ^{ n) = E x /b n and ^ = E 2 /b n . We have to prove that for 
S = l/n, 

sup max{ H 2 (s, t + 5) - E 2 (s, t) , E 2 (s + 5, t) - E 2 (s, t)} = o P (b n ) . 

s,t<l 

(4.24) 

Owing to the symmetry of the roles played by Cj and Lj, we will focus on 
the first term in the above maximum. We have 

E 2 (s,t + 5) - E 2 (s,t) = s^ltKCjKt+S + 1 U<s 

j i 

< Yl^C^t+S + nS (4.25) 
j 

:= N n (t,l/n) + l. (4.26) 
We have to prove that, for every e > 0, 

P(sup7V n (t, 1/n) > eb n ) ^ 0. (4.27) 

t 

or, equivalently, 

P(sup N n (t, 1/n) > [sb n \ ) -+ . (4.28) 

t 

The event {sup 4 N n (t, 1/n) > L^nJ} means that there is a subinterval 
of [0,1] of length 1/n which contains at least \_eb n \ points of the sample 

(Ci, . . . , C n ). Denote by < Cpy • • • , < ^(nj) ^ e reor dered sample. 

We have (denoting Co = by convention) 

{sup 7V n (t, 1/n) > [eb n \} C {3 < k < n : C$ +M) - eg < 1/n} (4.29) 



hence, by the union bound 

, \ \ ^ \ " HE ( f 



P(sup JV ft (t, 1/n) > Le&nJ ) < E P f^tl Ke\) ~ C (k) ^ V") ■ ( 4 -30) 



fc 

It is well known that the spacings follow the Dirichlet distribution of pa- 
rameter (1,1,..., 1) so that 

n n law gl + • • • + gr I A o->\ 

k(fc+r) - <-(fe) = gl + . . . + gn ' 

where gj are i.i.d. and exponential. From ()4.30p . we get 

F(sup N n (t, 1/n) > [eb n \) < nP ( gl + ' ' ' + g[6n£j < < (4.32) 
t V gl H \~gn nj 

< nP(gi + • • • + g n > 2n) + nP(gi + • • • + g LbnE j < 2) . 



(4.34) 
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But we know (Chernov bound) that 

P(gi + • • • + gn > 2n) < exp -nh{2) (4.33) 
F(gi + • • • + g L 6„ £ j < 2) < exp - [b n e\ h 
where 

h(x) = x — 1 — log x . 
We conclude that (ETTOl) is fulfilled. □ 

5. Moments 

If U is Haar distributed matrix on U(n) or O Haar distributed on O(n), 
let us denote by u the generic element \Uij\ 2 or O?-. We know that u follows 
the beta distribution on [0, 1] with parameter (/?', (n — l)/3') so that 

E\Uij\ 2 = - , E\Uij\ 4 = - 2 , Var^f = (5.1) 
n n(n + 1) n 2 (n + l) 

E|0 M f = - , E|Cy 4 = 3 , Var \O id \ 2 = 2( " + 3) , (5.2) 
n n{n + 2) n^(n + 2) 

and more generally 

„, ,ob (n — l)\k\ 

E\U ii \ 2k = J —— 5.3) 

1 Jl (n-l + k)\ v ; 

E\0--\ 2k - f5 4) 

^™ ~ n(n + 2)...{n + k-2)- [bA) 

Lemma 5.1. Let fcsN, 

(1) For every choice of indices i = (ii, ... ,if.) and j = (ji, . . . ,jk), 

1 



E{\U hjl \ 2 ...\U ikj f)=0(^) (5.5) 



(2) For every choice of indices i = (ii, . . . , ik) and j = (ji, . . . ,jk), 

E{\O lin \ 2 ...\O tkJk \ 2 )=0(±) (5.6) 

(3) For every choice of indices i = . . . and j = (ji, ■ ■ ■ ,jk)> f or 
the unitary and orthogonal cases, 

Efe-W=0(i) (5.7) 

Proof. (1) follows from the Weingarten formula giving the moments of Haar 
unitary matrix coefficients (see [6], Corollary 2.4). 

E • • • ^iAi: • • • U h , H ) = £ *paf WgCn^oT 1 ) (5.8) 
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where 5? = 1 if i(s) = i(a(s)) for every s < k and otherwise, and the 
asymptotics of the Weingarten function (see [6], Proposition 2.6): 

Wg(n,cr) = 0{n- k -\ a \) (5.9) 

where \a\ = k — #(cr). Note that the maximal order is obtained for the 
identity permutation Id for which \Id\ = and in this case, Wg(n,Id) = 
n- k (l + o(l)). 

(2) follows first from the Weingarten formula giving the moments of Haar 
orthogonal matrix coefficients (see [6], Corollary 3.4). For every choice of 
indices i= . . . , i k , i Is . . . , ^) and j = {ji, ■ ■ ■ ,jk,h, ■ ■ ■ Jk), 

E (O hjl . . . O ikjh O hh . . . O h h ) = 6 i 16 ! 2 Wg° (n) (pi,P2) (5.10) 

Pl,P2&M 2 k 

where M 2 k denotes the set of pairings of [2k], Wg° (n) is the orthogonal 
Weingarten matrix and 5? 1 (resp. 5? 2 ) is equal to 1 or if i (resp. j) is 
constant on each pair of p\ (resp. p 2 ) or not. 

We then use asymptotics for the orthogonal Weingarten matrix (see |6], 
Theorem 3.13): 

Wg° (n) (pi,p 2 ) = 0{n- k ~ 1 ^^) 
for some metric I on Ai 2 k- 

(3) follows from (1), resp. (2), and the definition of V in terms of |t7| 2 . □. 

We now need to have precise asymptotics for some moments of U up to 
order 8. 

Proposition 5.2. (1) 

E(|[/n| 4 |^ 22 | 4 ) = - 4 +0(^) (5.11) 

E(|t/n| 2 |[/i 2 | 2 |[/ 2 i| 2 |[/22| 2 ) = -1 + 0{\) (5.12) 

H\Un\ 2 \U 22 \ 4 ) = ^ + 0(±) (5.13) 

E(l^ii| 2 |f/i2| 2 |^2 2 | 2 ) = ^ + 0{\) (5.14) 

(2) 

n\Oii\ i \0 22 \ A ) = - A +0{\) (5.15) 

E(|On| 2 |Oi 2 | 2 |0 2 i| 2 |0 22 | 2 ) = \ + 0{\) (5.16) 

E(|O n | 2 |0 22 | 4 ) = 4 + 0(^) (5-17) 
E(|O n | 2 |0 12 | 2 |0 22 | 2 ) = -i + 0{\) (5.18) 
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Proof. Let us first prove (1). We shall give the details for the first formula 
(|5.1ip . the other ones are similar. From (|5.8p and (I5.9p . the main contribu- 
tion in (|5.8p is given by the pairs (a, (3) for which /3a _1 = Id. We now give 
all the permutations a and f3 giving a non null contribution in (I5.8P for the 
computation of E(| ?7"n | 4 1 C/22 1 4 ) • Tha admissible permutations a are given 
by their cycle decomposition: 

ai=Id=(l),(2),(3),(4) 
a 2 = (12),(3),(4) 
a 3 = (l),(2),(34) 
a 4 = (12), (34) 

and the same for the f3. 

There are 4 pairs of permutations giving j3a~ l = Id, those corresponding to 
a = j3. Therefore, we obtain (|5.1ip . 

For (|5.12p . the admissible a are the same and the corresponding f3 are given 
by 

ft = Id =(1), (2), (3), (4) 
& = (13),(2),(4) 
/5 3 = (1),(3),(24) 
& = (13), (24). 

Thus, there is only one pair (ai,(5\) giving the main contribution. 
The proof for the moments of order 6 is similar. 

Let us now prove (2). We refer to the paper pQ. In particular, these 
authors define 

I n (l fj = E(O^O b 12 O c 21 Ol 2 ) , I n -x(a,b) = E(O a u O b 12 ). (5.19) 
In the sequel we denote 

m\\ = (m — l)(m — 3) . . . 

We need 

I ( A °) I ( 2 2 ) I ( 2 °\ I ( 2 2 

The first and the third are ruled by Theorem C therein, so that 

(2 0\ n!2!!4!!(n + 4)!! _ 3(n + 3) _ 3_ J_ 

"^0 4j ~ n !!(n + 2)!!(n + 5)!! ~ n(n - l)(n + 2)(n + 4) ~ n 3 + ^n 4> 
and 



In 



4 0\ n!4!!4!!(n + 6)!! 



y (n + 2)!!(n + 2)!!(n + 7)!! 



9( n + 3)( n + 5 ) 9 + 1 



(n + 6)(n + 4)(n + 2)(n + l)n(n - 1) n 4 n 5 ' 
Now, from Definition 3.1 therein 

In {I ^=[1^(2,2)^(1 2 
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and from Theorem 6.2 



(2 2\ n — 1)!! 
® \2 2 J = ( re - 2)!! + UQ1 + Ul ° + 



with 



(n + 6)!! (n + 4)!! ,(" + 2)!! 

(n + 7)!! (n + 7)!! (n + 7)!! 

Now, from Theorem 1.2 therein 
which gives 

'2 2\ (n 2 + 4n + 7) J_ J_ 

^2 2) ~ ( n + 6)(n + 4)(n + 2)(n + l)n(n- 1) ~ n 4 + ^ 

The last one is managed with the same scheme: 

/n( 2 ^ = ^(2,2)^ (2) *(2 2^j 

and from Theorem 4.3 therein 

(2 2\_ (n-l)!![(n + 2)!!] 2 
\0 2) ~ (n-2)!!n!!(n + 5)!! 

and since / n _i(2) = E0 2 x = - we have got 

(2 2\ _ (n-l)!!(n + 2)!! _ (n+1) _ 1 1 
^0 2,1 n(n + 2)!!n!!(n + 5)!! n 2 (n + 4)(n + 2) n 3 V j 

□ 

Lemma 5.3. (1) In the unitary case 

lim n 2 (n - 1) 2 E = 1 (5.20) 

n— s-oo 

erne/ 

lim n 2 (n - 1) 2 E (^11^12^21^22) = (5.21) 

n-s>oo 

(2) In the orthogonal case 

lim r?(n - 1) 2 E = 4 (5.22) 

and 



lim n 2 (n - 1) 2 E (^11^12^21^22) = (5.23) 

n— >oo 



Proof. We develop 



^22 = (Wiif - Vnl 2 + ^)(|f/ 22 | 4 - -\u 22 \ 2 + \) 
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to obtain 

nvAvl) = E(|*7 n | 4 |£/d 4 ) - -E(l^ii| 4 |C/22| 2 ) + ^n\Uu\ 4 ) 

+±E(|[/ U | 2 |[/ 22 | 2 ) -^E(\U U \ 2 ) + \ . (5-24) 

in the unitary case and the same expression with Uij replaced by Oij in the 
orthogonal case. It remains to make the substitutions from Proposition 15.21 
The proof of the second limit is similar. □ 

6. Appendix : Proofs of Theorem 14.21 and Theorem 14.41 

We define D n = \Uij\ 2 RiYj and we prove that A n and D n have the 
same limit in law. We can write 

n n 

A n := ^j{n)Cj := 5 n _i + A n (n)C n , D n := ^ Aj(n)Yj 

where Aj(n) = Yl?=i\Uij\ 2 Ri are independent of Cj and Yj. Let F be a 
smooth function with a bounded third derivative. The first step consists in 
replacing C n by Y n and to compare E(F(5 n _i + A n (n)C n )) and K(F(S n -i + 
A n (n)Y n )). Using a Taylor expansion, 

E(F(5 n _i + A n (n)C n )) = E(F(5 n _ x )) + E(F'(S n ^)) E(A n (n)) E(C n ) 

+ l - E(F"(5 n _i)) E(A n (n) 2 ) E(C 2 ) + 0(E(A n (n) 3 )) E(C 3 ) 

and a similar expression for ~E(F(S n -i + A n (n)Y n )). All the terms in the two 
expressions, but the last, are equal. We thus need to estimate E(A n (n) 3 ) 
By centering, 

E(A„(n) 3 ) = f>(|[y in | 6 ) E(Rf) = O(l) (6.1) 

from (15. 3p or (15. 4p . We repeat the operation of replacement of Cj by Yj 
from j = n — 1 to 1 and by a summation, we obtain 

E(F(A n ))-E(F(D n )) = 0(-). 

n 

In the same way D n and -B n have the same limit in law, by exchanging the 
role of i and j. 

We can extend this proof for the finite dimensional distributions. The 
proof is the analogue as above, using a Taylor expansion of the two expres- 
sions, involving a smooth function F of k variables, 

F(S' n _i(si,ti)+A n (si)(l (c . n < <l) -ti), . . . ,S , „_i(s fe ,t fc )+A n (s i .)(l (ai < tfc) -t fc )) 
and 

F(S n ^ 1 (s 1 ,t 1 ) +A n (si)7„(ti),... ySn-^s^tk) + A n (s k )j n {t k )). 

We then use that the process (lfQ.<t) — t) has the same covariance as the 
Brownian bridge and the estimate (|6.1j) ,D 
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